Semi-Supervised Learning on Graphs Based on Local Label Distributions
نویسندگان
چکیده
In this work, we propose a novel approach for the semi-supervised node classification. Precisely, we propose a method which takes labels in the local neighborhood of different locality levels into consideration. Most previous approaches that tackle the problem of node classification consider nodes to be similar, if they have shared neighbors or are close to each other in the graph. Recent methods for attributed graphs additionally take attributes of the neighboring nodes into account. We argue that the labels of the neighbors bear important information and considering them helps to improve classification quality. Two nodes which are similar based on labels in their neighborhood do not need to lie close-by in the graph and may even belong to different connected components. Considering labels can improve node classification for graphs with and without node attributes. However, as we will show, existing methods cannot be adapted to consider the labels of neighboring nodes in a straightforward fashion. Therefore, we propose a new method to learn label-based node embeddings which can mirror a variety of relations between the class labels of neighboring nodes. Furthermore, we propose several network architectures which combine multiple representations of the label distribution in the neighborhood with different localities. Our experimental evaluation demonstrates that our new methods can significantly improve the prediction quality on real world data sets.
منابع مشابه
Learning from Partially Labeled Data: Unsupervised and Semi-supervised Learning on Graphs and Learning with Distribution Shifting
This thesis focuses on two fundamental machine learning problems: unsupervised learning, where no label information is available, and semi-supervised learning, where a small amount of labels are given in addition to unlabeled data. These problems arise in many real word applications, such as Web analysis and bioinformatics, where a large amount of data is available, but no or only a small amoun...
متن کاملLabel Propagation for Semi-Supervised Learning in Self-Organizing Maps
Semi-supervised learning aims at discovering spatial structures in high-dimensional input spaces when insufficient background information about clusters is available. A particulary interesting approach is based on propagation of class labels through proximity graphs. The Emergent Self-Organizing Map (ESOM) itself can be seen as such a proximity graph that is suitable for label propagation. It t...
متن کاملNonparametric Maximum Margin Similarity for Semi-Supervised Learning
1. Nonparametric Label Propagation (LP) has been proven to be effective for semi-supervised learning problems, and it predicts the labels for unlabeled data by a harmonic solution of an energy minimization problem which encourages local smoothness of the labels in accordance with the similarity graph. 2. On the other hand, the success of LP algorithms highly depends on the underlying similarity...
متن کاملBidirectional Semi-supervised Learning with Graphs
We present a machine learning task, which we call bidirectional semi-supervised learning, where label-only samples are given as well as labeled and unlabeled samples. A label-only sample contains the label information of the sample but not the feature information. Then, we propose a simple and effective graph-based method for bidirectional semisupervised learning in multi-label classification. ...
متن کاملLabel propagation based on local information with adaptive determination of number and degree of neighbor's similarity
In many practical applications of machine vision, a small number of samples are labeled and therefore, classification accuracy is low. On the other hand, labeling by humans is a very time consuming process, which requires a degree of proficiency. Semi-supervised learning algorithms may be used as a proper solution in these situations, where ε-neighborhood or k nearest neighborhood graphs are em...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1802.05563 شماره
صفحات -
تاریخ انتشار 2018